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Positive-stranded genomic RNA of coronavirus MHV and its 
six subgenomic mRNAs are synthesized in the cytoplasm of 
the host cell. The mRNAs are composed of leader and body 
sequences which are non-contiguous on the genome and are 
fused together in the cytoplasm by a mechanism which ap- 
pears to involve an unusual and specific ‘polymerase jump- 
ing’ event. 
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Introduction 


RNA viruses display a variety of replication strategies but 
share many biosynthetic pathways with the host cell. They are 
therefore excellent probes for the investigation of cell pro- 
cesses and their study has played an important role in the 
discovery of several aspects of gene expression in animal cells. 
To date, these studies have concentrated on a relatively small 
number of viruses which were particularly suited to biochemi- 
cal analyses, but there are also a number of RNA virus 
families for which the replication strategy is almost, or com- 
pletely, unknown. In spite of their pathogenetic properties, 
and the resulting economic losses, coronaviruses were, until 
recently, such a group. However, in the last few years many 
basic aspects of coronavirus replication have been elucidated 
(for reviews, see Siddell et a/., 1983; Sturman and Holmes, 
1983). 

Coronaviruses are enveloped positive-stranded RNA 
viruses. Their genome is linear, unsegmented and 
15 000 —20 000 bases in length. The most studied member of 
the group is murine hepatitis virus (MHV). MHV replicates in 
the cell cytoplasm and viral genetic information is expressed 
in infected cells as one genomic sized and multiple sub- 
' genomic mRNAs. These mRNAs are synthesized in non- 
equimolar amounts, but in relatively constant proportions 
throughout infection (Leibowitz et a/., 1981). The template 
for viral MRNA synthesis is a genomic length negative strand 
(Lai et al., 1982a). Analysis of virion and mRNA of MHV- 
A59 by two independent methods reveals that the viral 
mRNAs have a ‘nested set’ structure with 3’ -co-terminal ends 
and sequences extending for different lengths in a 5’ direction 
(Leibowitz et al., 1981; Lai et al., 1981, 1982b; Spaan et al., 
1982; Cheley ef al., 1981). Each subgenomic mRNA is capped 
and polyadenylated, as is gnome RNA (Lai et al., 1982b), 
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and is translated independently to produce a single protein the 
size of which corresponds to the coding capacity of the 5’ se- 
quences not found in the next smallest mRNA (Siddell, 1983; 
Rottier et al., 1981; Leibowitz et a/. 1982). There is at present 
no indication that the replication of MHV involves a nuclear 
phase or nuclear factors. MHV is reported to grow in 
enucleated cells and its replication is not inhibited by actino- 
mycin D or alpha-amanitin (Brayton ef a/., 1981; Wilhelmsen 
et al., 1981; Mahy ef al., 1983). Also, the synthesis of each 
mRNA is inactivated by u.v. irradiation in proportion to its 
own length (Jacobs et al., 1981). Thus the subgenomic 
mRNAs are not produced by the processing of larger RNAs. 

Analysis of oligonucleotides generated by RNase T1 diges- 
tion of viral RNA reveals unique oligonucleotides that do not 
fit into the ‘nested set’ structure. T1 oligonucleotides 10 and 
19 from mRNA7 of MHV-AS9 (using the nomenclature of 
Lai et al., 1982b) are not present in the corresponding 3’ end 
of the genome (Spaan ef al., 1982; Lai et al., 1983), sug- 
gesting that these oligonucleotides are derived from a leader 
sequence which all mRNAs might share. Oligonucleotide 10 is 
also found in the larger mRNAs and genome RNA (Lai et ai., 
1983) but mRNA-specific differences in electrophoretic 
mobility have been detected for oligonucleotide 19. It is pre- 
sent as oligonucleotide 19, 19a and 3a in mMRNA7, mRNA6 
and mRNAS, respectively. Oligonucleotides 19 and 19a have 
very similar base compositions. Oligonucleotide 17, which is 
found in mRNA6 and larger mRNAs, but not in mRNA7 
(Lai et al. 1982b, 1983; Spaan et al., 1982), also has a base 
composition similar to oligonucleotides 19 and 19a (Arm- 
strong ef al., 1983). Finally, the mRNAs share at least five 
nucleotides at their S' end (Lai et al., 1982b). 

These data can be interpreted in the model shown in Figure 
1. In this model, sequences present at the 5’ end of genome 
RNA, or mRNAI, are also found at the 5’ end of each sub- 
genomic mRNA (these sequences will be referred to as 
leaders). MHV-AS59 oligonucleotide 10 would be encompass- 
ed within these sequences. Part of the MHV-AS59 oligonuc- 
leotides 19 and 19a would also be contained within the leader, 
but differences would arise from fusion of the leader se- 
quence with the various bodies of the mRNAs. Oligonucleo- 
tide 17 would represent sequences at the 5’ end of the 
mRNA7 body, part of which would be lost during the con- 
struction of mRNA7 but not for example mRNA6. 

In the experiments described here we have investigated the 
structural organization of MHV mRNA by electron micro- 
scopic analysis of hybrids formed between single-stranded 
cDNA copies from mRNA7 with genomic RNA or mRNA6. 
In addition, we have determined the nucleotide sequence at 
the 5’ end of MHV-A59 mRNA7 and the corresponding 
region of the genome. The 5S’ sequences of MHV-JHM 
mRNA7 have also been determined. The MHV-AS9 and the 
MHV-JHM strains have unrelated RNase T1 fingerprints 
(Lai and Stohlman, 1981), but the recognition sites for the fu- 
sion of the leader sequence and the mRNA body might be ex- 
pected to be conserved between strains as well as in different 
mRNAs of the same strain. 
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Fig. 1. A model for the structural relationships of MHV-A59 genome RNA and mRNA6 and 7. The symbols @ and A, represent 5‘-terminal cap structures 
and 3'-terminal polyadenylate tracts, respectively. RNase T1-resistant oligonucleotides are identified by numbers (Lai ef a/., 1982b). The sequences X and Y 
are not yet identified. El (matrix protein) and N (nucleocapsid protein) are the translation products of mRNA6 and mRNA7, respectively. The boxed regions 


are represented on a larger scale than the other regions of the mRNAs. 


Results 


Electron microscopy of hybrids between single-stranded 
cDNA copied from mRNA 7 with genome RNA or mRNA6 


Single-stranded cDNA was prepared as described (Arm- 
strong ef al., 1983) and after hydrolysis of the RNA template 
it was treated with glyoxal and dimethylsulfoxide and analyz- 
ed by agarose gel electrophoresis. A main band approxi- 
mately the size of mRNA7 was identified (data not shown). 
This cDNA was annealed to genome RNA and prepared for 
electron microscopy by cytochrome spreading. A sequence 
homology between the 5’ end of the RNA copied into cDNA, 
and the 5’ end of the genome RNA should lead to the forma- 
tion of a looped hybrid and indeed such structures were 
observed. Figure 2A shows such a hybrid molecule accom- 
panied by a tracing outlining the possible arrangement of the 
RNA and DNA strands. Only linear molecules were observed 
when preparations of RNA alone were analysed in the same 
way and circularization, therefore, has to be attributed to the 
hybrid formation. The length of the hybrid region in these 
molecules corresponds to 1995 + 160 bp, using PM2 DNA 
as a standard and after correction for the shortened hybrid 
length (Priess et al., 1980). The size of the loop was determin- 
ed as 19.4 + 1.0 kb. This value is an approximation due to 
the lack of a suitable RNA standard of this size. No double- 
strand could be discerned at the point of the re-entry of the 
genome RNA into the hybrid near the 5’ end. This excludes a 
double-stranded region much larger than 50 nucleotides. 
Figure 2B shows a hybrid between the same cDNA and 
mRNA6. Again, a double-stranded region (1890 + 140 bp) 
caused by hybridization between the cDNA and the 3’ end of 
mRNA6, and a single-stranded loop structure (in this case of 
600 + 80 nucleotides) were observed. This single-stranded 
loop probably represents the El gene. Its size would be suffi- 
cient for it to encode a polypeptide of 22 000 + 2900 mol. 
wt., the approximate size of the non-glycosylated form of 
polypeptide E1 found in MHV-AS59-infected cells (Rottier et 
al., 1981). The most likely explanation of the loop formation 
in these hybrids is once again the presence of common leader 
sequences in mRNA6 and mRNA7, although again no hybrid 
stretch on the 5’ side of the loop could be detected. The data 
strongly support the model shown in Figure 1. 


Sequence analysis of mRNA7 and the region of the genome 
between the El and N genes 


A further prediction of the model shown in Figure 1 is that 
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the nucleotide sequence of the region immediately upstream 
from the nucleocapsid gene of mRNA7 should diverge from 
the region upstream from the nucleocapsid gene in the 
genome. Therefore, we determined the nucleotide sequence in 
these regions by two independent procedures. Firstly, two 
recombinant DNA clones were analysed. One clone, MS38, 
was obtained from mRNA7 of MHV-JHM. An outline of the 
procedures to obtain and sequence this clone is given in 
Materials and methods and full details will be given elsewhere 
(Skinner and Siddell, 1983). This clone corresponds to the 5’ 
half of mRNA7. Two large T1 oligonucleotides are predicted 
from this sequence. One oligonucleotide (position — 53/ — 29) 
would have a base composition very similar to that reported 
for MHV-AS9 oligonucleotide 10, the other (position 
— 25/ —2) was similar to oligonucleotide 19 (Figure 3, the dif- 
ferences will be discussed below). At position 1 an AUG 
codon is found that generally conforms to a consensus for in- 
tiation sequences (Kozak, 1981), particularly in the occur- 
rence of an A residue at position — 3. It is followed by an ex- 
tensive open reading frame (Skinner and Siddell, 1983). 

The second recombinant DNA cloned and sequenced was 
prepared from a mixture of MHV-A59 mRNAs. Clone S9 
covered sequences upstream from the initiation codon 
preceding a sequence of 1664 nucleotides at the 3’ end of the 
genome, including the nucleocapsid gene (Armstrong et al., 
1983) (Figure 4). Clone S9 started to diverge (reading 3’ to 
5’) from clone MS38 at position —22, exactly at the same 
point as oligonucleotides 17 and 19 diverge from each other, 
when ordered for maximal homology with the DNA se- 
quences (Figure 3). These data therefore support the model of 
Figure 1, and imply that clone S9, which lacked sequences 
corresponding to oligonucleotide 10, represents sequences not 
found in mRNA7, but present in other mRNA species and 
genome RNA (see below). 

The second approach we have used was to sequence direct- 
ly the 5’ end of mRNA7 and the corresponding region of the 
genome of MHV-AS59 using DNA primers and reverse 
transcriptase for dideoxy sequencing on RNA (McReynolds 
et al., 1978; Zimmern and Kaesberg, 1978; Zain and Roberts, 
1979). The primers used were a synthetic primer complemen- 
tary to positions 11 —24 (New England Biolabs) and a restric- 
tion fragment isolated from the RF DNA of clone S38 (Figure 
4) after digestion with restriction endonuclease Hpall. A 
single-stranded primer was obtained by treating the fragment 
with exonuclease III prior to hybridisation (Zain and Roberts, 
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Fig. 3. Comparison of nucleotide sequences of the 5’ end of MHV mRNA7 and the homologous part of genome RNA and the alignment with T1 
oligonucleotides. The sequences were obtained as described in Materials and methods. The alignment of T1 oligonucleotides with the other sequences was 
made by maximizing homology, using the base compositions determined by Lai ef al. (1982b). Homologies between adjacent sequences are indicated by 


asterisks. 
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Fig. 4. Localization of DNA clones of MHV used to determine the se- 
quences upstream from the gene encoding the nucleocapsid protein (N). 
The physical map of the sequence of the nucleocapsid gene is shown. Pro- 
tein (N) is translated from position 0 to 1362. Sequence divergence between 
the 5‘ end of mRNA7 and the corresponding region of the genome (Figure 
3) is indicated in MS38. 


origin of clone F10 which previously was suspected to repre- 
sent the genome of MHV-AS59 (Armstrong ef al., 1983) re- 
mains to be determined. 


Localization of the fusion site of leader and body sequences 
of MHV mRNAs 


From the above data, summarized in Figure 5, it is clear 
that the 5’ terminus of the MHV-A59 mRNA body cannot 
extend beyond position -—21, the site of divergence of 
mRNA7 and the genome. As mentioned before, T1 oligonuc- 
leotide 19a (specific for RNA6) and T1 oligonucleotide 19 
(specific for RNA7) have a very similar base composition. By 
predicting the sequence of 19a from the base composition 
(Lai et al., 1982b) and by comparison with the MHV-AS59 se- 
quence between — 24 and — 2 (Figure 5) it can be seen that the 
3’ terminus of the leader sequence fused to mRNA6 cannot 
extend beyond the first base difference (reading in a 5’ to 3’ 
direction) in oligonucleotide 19 and 19a. Thus the data sug- 
gest that for MHV-AS59 the fusion of leader and body se- 
quences producing oligonucleotides 19 and 19a occurred 
within the sequence 5’ AAUCUAAUCUAAACLU 3’, a se- 
quence that does not contain the consensus (A/C)AG/G 
established for splice junctions in viral and cellular mRNAs 
(Flint, 1981). It was interesting that the only major difference 
between A59 and JHM virus found in the 5’ -terminal region 
was within this fusion sequence. The repeated palindromic 
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pentamer AUCUA (or the sequence AAUCU) might function 
as a recognition signal during the fusion process. 


Discussion 
How are the leader and body sequences of coronaviral 
mRNAs fused? 

Previous studies have demonstrated that MHV sub- 
genomic mRNAs are not produced by the processing of larger 
RNAs and that conventional splicing mechanisms are not in- 
volved in MHV mRNA synthesis (Brayton ef a/., 1981; Wil- 
helmsen ef al., 1981; Mahy ef al., 1983; Jacobs eft al., 1981). 
The data presented here could, however, be explained by a 
polymerase jumping mechanism. Translocations of RNA 
viral polymerases have been postulated before to explain the 
generation of mutant defective interfering (DI) particles (Per- 
rault, 1981; Lazzarini et a/., 1981; Fields and Winter, 1982). 
In the case of MHV, however, the mechanisms would be part 
of the normal pathway of viral replication, and thus would be 
the first example of specific fusion of long non-contiguous 
RNA sequences in the cytoplasm. The mechanism we pro- 
pose for the generation of coronavirus mRNA would involve 
the synthesis of a short RNA transcript from the 3’ end of the 
negative stranded template. The polymerase/leader complex 
would then be translocated to an internal position on the 
negative stranded template where transcription would 
resume. This translocation may or may not involve dissocia- 
tion of the polymerase/leader complex from the template. 
Translocation would have to occur to specific positions and 
at specific frequencies. It seems likely that this specificity 
would be, at least in part, related to sequences in the 5’ non- 
coding region of MHV RNAs. We have argued that the fu- 
sion of leader and body sequences during the synthesis of 
mRNA7 and mRNA6 occurs within the sequence 5’ 
AAUCUAAUCUAACLU 3’. This sequence or its 5’ end 
might also be present within region X of the leader (Figure 1) 
creating a donor and acceptor site sequence homology. 

This mechanism superficially resembles that proposed for 
the formation of DI influenza virus RNA. Fields and Winter 
(1982) suggested that after the termination of transcription in 
a U-rich region of influenza genomic RNA, sequences at the 
3’ terminus of the nascent chain can base pair to sites down- 
stream on the template. A similar mechanism might also ex- 
plain the high frequency of recombination between 
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Fig. 5. Localization of the fusion site between body and leader of MHV mRNA. For details set text. 


aphthovirus RNAs (King ef a/., 1982). Therefore, the syn- 
thesis of MHV mRNAs could be considered as a refinement 
of a mechanism which occurs in the generation of mutant 
forms of other RNA viruses. The distinguishing features of 
the mechanism in coronavirus infection are that it is highly ef- 
ficient and specific, being the normal route for viral replica- 
tion. Clearly, further sequence analyses of the extreme 5’ end 
of the MHV genome and of more intergenic regions are need- 
ed to elucidate the details of this unusual mechanism of RNA 
synthesis. 


Materials and methods 


Viruses and cells 


The growth of coronavirus MHV in Sac({—) cells has been described 
previously (Siddell et a/., 1980; Spaan et al., 1981). 


Electron microscopy 


The cDNA was incubated with RNA at a concentration of ~ 1 ng/ml in 
50% formamide containing 10 mM Tris-HCl, 1 mM EDTA and 0.2 M CsCl 
for 30 min at 40°C. A 10-fold diluted aliquot was spread from 30% form- 
amide, 0.1 M Tris-HCl, 1 mM EDTA, 0.1 pg/ml PM2 DNA with CNBr- 
treated cytochrome (Delius ef a/., 1972) on a hypophase of distilled water. 
Samples were picked up on Parlodion-coated grids, stained with uranyl 
acetate and rotary shadowed with platinum. 


cDNA cloning and sequencing 


The procedures used to clone and sequence the 5’ region of MHV-JHM 
mRNA7 were briefly as follows. Polyadenylated RNA from MHV-JHM in- 
fected Sac(—) cells was used as a template for avian myeloblastosis virus 
reverse transcriptase in a reaction primed with oligo(dT). After alkali 
hydrolysis of RNA, single-stranded cDNA was tailed with oligo(dC) using ter- 
minal transferase. Second strand synthesis was primed with oligo(dG). 
Double-stranded cDNA was tailed with oligo(dC) and annealed to the plasmid 
pAT153 which had been linearized with Ps¢I and tailed with oligo(dG). After 
transformation with this DNA, Escherichia coli HB101 colonies with an 
Am‘/Tc’ phenotype were screened for coronavirus sequence inserts by 
hybridization to single-stranded [*4P]cDNA copied from virion RNA. One 
clone isolated by this procedure, MS38, was further characterized by Nor- 
thern blotting and restriction enzyme mapping. After subcloning into M13, 
the inserted DNA was sequenced by the chain terination method of Sanger et 
al. (1977). The procedures used to obtain and sequence MHV AS9 clone S9 
have been described in detail (Armstrong ef a/., 1983). 


Sequencing of RNA with DNA primers and reverse transcriptase 


Clone S83 (Armstrong ef al., 1983; Figure 4) was used to isolate a DNA 
primer for sequencing directly on the viral RNA. Replicative-form DNA was 
isolated from a 21 culture of S83-infected bacteria, by Triton lysis and 
equilibrium centrifugation in CsCl. 50 ug was digested with 60 units of endo- 
nuclease Hpall (Bethesda Research Laboratories) for 1 h at 37°C in a volume 
of 100 pl. After the digestion, the DNA was treated with bacterial alkaline 
phosphatase (P-L Biochemicals Inc.), labelled using [#7PJATP (3000 Ci/ 
mmol, Radiochemical Centre, Amersham) and T4 polynucleotide kinase 
(Bethesda Research Labs.) and purified by gel electrophoresis using standard 
procedures. The fragment was eluted and purified on a DEAE column 
(Smith, 1980) and dissolved in 50 yl 10 mM Tris-HCl pH 8.0, 1 mM EDTA. 


10 pl of the Hpall fragment were treated with 2 units of exonuclease III 
(Bethesda Research Labs.) in SO mM Tris-HCI pH 8.0, 5 mM MgCl, 10 mM 
dithiothreitol (DTT) for 30 min at 37°C in a volume of 20 pl. The mixture 
was boiled for 3 min, quickly cooled in a dry-ice ethanol bath and 1 yl of 
RNA7 or genomic RNA (1 mg/ml), 4 1 10-fold concentrated reverse 
transcriptase buffer (1x buffer is 50 mM Tris-HCl pH 8.3, SO mM KCI and 
8 mM MgCl.) and 5 nl H,O were added. Hybridization was carried out for 
15 min at 68°C. To 7 pl of this primed mRNA, in a 10 pl reaction volume, 
1 pl of 10 mM DTT, 4 Ci [?2PJdCTP (400 Ci/mmol, Radiochemical Centre, 
Amersham), 3 units reverse transcriptase (Life Sciences Inc.) and 1 yl of one 
of the following chain-terminating nucleotide stock mixtures were added; ddA 
mixture: 50 nM dATP, 100 nM dGTP, 100 uM dTTP and 5 pM ddATP: 
ddC mixture; 250 nM dATP, 250 uM dGTP, 250 uM dTTP and 2.5 pM 
ddCTP: ddG mixture; 100 nM dATP, 50 »M dGTP, 100 »M dTTP and 
5 uM ddGTP: ddT mixture; 250 .M dATP, 250 »M dGTP, 250 nM dTTP 
and 25 »M ddTTP. Incubation was at 42°C for 30 min and | pl of a0.5 mM 
solution of unlabeled deoxynucleoside triphosphate was then added to chase 
for an additional 0.5 h. The reaction products were analyzed on 0.25 mm 
thick and 40 cm long 6% polyacrylamide gels (Garoff and Ansorge, 1981). 
Direct sequencing with the synthetic primer was performed using the same 
protocol. 
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